Exploring Domain Differences for the Design of a Pronoun Resolution System for Biomedical Text
نویسندگان
چکیده
Much effort in the research community has been spent on solving the anaphora resolution or pronoun resolution problem, and in particular for news texts. In order to selectively inherit the previous works and solve the same problem for a new domain, we carried out a comparative study with three different corpora: MUC, ACE for the news texts, and GENIA for bio-medical papers. Our corpus analysis and experimental results show the significant differences in the use of pronouns in the two domains, thus by properly considering the characteristics of a domain, we can improve the performance of pronoun resolution for that domain.
منابع مشابه
Challenges in Pronoun Resolution System for Biomedical Text
This paper presents our findings on the feasibility of doing pronoun resolution for biomedical texts, in comparison with conducting pronoun resolution for the newswire domain. In our experiments, we built a simple machine learning-based pronoun resolution system, and evaluated the system on three different corpora: MUC, ACE, and GENIA. Comparative statistics not only reveal the noticeable issue...
متن کاملComparing Domain-Specific and Non-Domain-Specific Anaphora Resolution Techniques
A quantification is provided for the improvements made in traditional salience-based pronominal anaphora resolution precision when input text that has been parsed using a large scale grammar to locate syntactic function of noun phrases, is used instead of input text where more shallow syntactic analysis techniques were used for identifying grammatical function. In addition, domain-specific tech...
متن کاملCorpus based coreference resolution for Farsi text
"Coreference resolution" or "finding all expressions that refer to the same entity" in a text, is one of the important requirements in natural language processing. Two words are coreference when both refer to a single entity in the text or the real world. So the main task of coreference resolution systems is to identify terms that refer to a unique entity. A coreference resolution tool could be...
متن کاملDesign and Fabrication Process of MTF Phantom CT Scan
Introduction: One of the main steps in the optimization process in diagnostic imaging is the quality control of radiology devices. The usual method of CT scan calibration is used of a phantom. The phantom created a certain weakening for the radiation through which it passes. One of the most suitable methods for quantitative analysis of the resolution and contrast in CT scan im...
متن کاملDesign of Small Animal Computed Tomography Imaging for in vitro and in vivo Studies
Introduction: Mini Computed Tomography (mini-CT) was suggested in biomedical research to investigate tissues and small animals. We present designed and built a mini x-ray computed tomography (mini-CT) for small animals as well as industrial component imaging. Materials and Methods: The system used in this study includes a X-ray tube 20kV to 160kV and a flat pa...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008